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(54) System and method for protection of digital works 



(57) A method of protecting a digital work uses a 
blind transformation function to transform an encrypted 
digital work into encrypted presentation data. The orig- 
inator's digital content is protected in its original form by 
not being decrypted. This method enables the rendering 
or replay application to process the encrypted document 
into encrypted presentation data without decrypting it 



first. Encrypted presentation data is then decrypted just 
before it is displayed to the user. The blind transforma- 
tion function is a function of the original transformation 
function. For example, the blind transformation function 
may be a polynomial of the original transformation func- 
tion. Alternatively, both the blind transformation function 
and the original transformation function may be any mul- 
tivariate, integer coefficient affine function. 
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Description 

C pyrightN tice 

5 [0001 ] A portion of the disclosure of this patent document contains material which is subject to copyright protection. 
The copyright owner has no objection to the facsimile reproduction by anyone of the patent document or the patent 
disclosure as it appears in the Patent and Trademark Office file or records, but otherwise reserves all copyright rights 
whatsoever. 

10 Related Application 

[0002] This application is a continuation-in-part application of application no. 09/1 78,529 filed October 23, 1 998. 
Field of the Invention 

15 

[0003] The invention relates to document rights management, and more particularly, to a method for protecting digital 
works which employs a blind transformation to transform encrypted digital works into encrypted presentation data. 

Background of the Invention 

20 

[0004] One of the most important issues impeding the widespread distribution of digital documents or works via 
electronic commerce is the current lack of protection of the intellectual property rights of content owners during the 
distribution and use of those digital documents or works. Efforts to resolve this problem have been termed "Intellectual 
Property Rights Management" ("IPRrVT), "Digital Property Rights Management" ("DPRM"), "Intellectual Property Man- 
25 agement" ("IPM") t "Rights Management" CRM"), "Digital Rights Management" ("DRM") and "Electronic Copyright Man- 
agement" ("ECM"). At the core of Digital Rights Management is the underlying issue of ensuring that only authorized 
users may perform operations on digital documents or works that they have acquired. Once accessed, the content 
must not be distributed or used in violation of the content owner's specification of rights. 

[0005] A document or work, as the term is used herein, is any unit of information subject to distribution or transfer, 
30 including but not limited to correspondence, books, magazines, journals, newspapers, other papers, software, photo- 
graphs and other images, audio and video clips, and other multimedia presentations. A document may be embodied 
in printed form on paper, as digital data on a storage medium, or in any other known manner on a variety of media. A 
digital work, as the term is used herein, is any document, text, audio, multimedia or other type of work or portion thereof 
maintained in a digital form that can be replayed or rendered using a device or a software program. 
35 [0006] In the world of printed documents, a work created by an author is usually provided to a publisher, which 
formats and prints numerous copies of the work. The copies are then sent by a distributor to bookstores or other retail 
outlets, from which the copies are purchased by end users. 

[0007] While the low quality of copying and the high cost of distributing printed material have served as deterrents 
to the illegally copying of most printed documents, it is far too easy to copy, modify, and redistribute unprotected elec- 
40 tronic documents. Accordingly, some method of protecting electronic documents is necessary to make it harder to 
illegally copy them. This will serve as a deterrent to copying, even if it is still possible, for example, to make hardcopies 
of printed documents and duplicate them the old-fashioned way 

[0008] With printed documents, there is an additional step of digitizing the document before it can be redistributed 
electronically; this serves as a deterrent. Unfortunately, it has been widely recognized that there is no viable way to 
45 prevent people from making unauthorized distributions of electronic documents within current general-purpose com- 
puting and communications systems such as personal computers, workstations, and other devices connected over 

local area networks (LANs), intranets, and the Internet. Many attempts to provide hardware-based solutions to prevent 
unauthorized copying have proven to be unsuccessful. 

[OOpc* 1 Two basic schemes have been employed to attempt tc solve the document rrolecticn problem: secure cen- 
se i&inert systems which reiy on cryptographic mechanisms) ana trusted systems. 

[0010] Cryptographic mechanisms encrypt (or "encipher") documents that are then distributed and stored publicly, 
and ultimately privately decrypted by authorized users. Cryptographic mechanisms provide a basic form of protection 
during document delivery from a document distributor to an intended user over a public network, as well as during 
document storage on an insecure medium. Many digital rights management solutions rely on encrypting the digital 
55 work and distributing both the encrypted message and decryption key to the consumer's system. While different 
schemes are employed to hide the decryption key from the consumer, the fact remains that ail necessary information 
is available for a malicious user to defeat the protection of the digital work. Considering that current general-purpose 
computers and consumer operating systems provide little in the way of sophisticated security mechanisms, the threat 
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is both real and obvious. 

[0011] A "secure container" (or simply an encrypted document) offers a way to keep document contents encrypted 
until a set of authorization conditions are met and some copyright terms are honored (e.g., payment for use). After th 
various conditions and terms are verified with the document provider, the document is released to the user in cl ar 
5 form. CommerciaJ products such as IBM's Cryptolopes and InterTrust's Digiboxes fall into this category. Clearly, the 
secure container approach provides a solution to protecting the document during delivery over insecure channels, but 
does not provide any mechanism to prevent legitimate users from obtaining the clear document and then using and 
redistributing it in violation of content owners' intellectual property. 

[001 2] Cryptographic mechanisms and secure containers focus on protecting the digital work as it is being transferred 
10 to the authorized user/purchaser. However, a digital work must be protected throughout its use from malicious users 
and malicious software programs. Even if a user is a trusted individual the user's system may be susceptible to attack. 
A significant problem facing electronic commerce for digital works is ensuring that the work is protected on the targ t 
consumer's device. If the protection for the digital work is compromised, valuable and sensitive information is lost. To 
complicate matters, today's general-purpose computers and consumer operating systems are deficient in the areas of 
15 security and integrity. Protecting the work throughout usage is a much more complex issue that remains largely un- 
solved. 

[0013] In the "trusted system- approach, the entire system is responsible for preventing unauthorized use and dis- 
tribution of the document. Building a trusted system usually entails introducing new hardware such as a secure proc- 
essor, secure storage and secure rendering devices. This also requires that all software applications that run on trusted 
20 systems be certified to be trusted. While building tamper-proof trusted systems is still a real challenge to existing 
technologies, current market trends suggest that open and untrusted systems such as PC's and workstations will be 
the dominant systems used to access copyrighted documents. In this sense, existing computing environments such 
as PC s and workstations equipped with popular operating systems (e.g., Windows and UNIX) and render applications 
(e.g., Microsoft Word) are not trusted systems and cannot be made trusted without significantly altering their architec- 
ts tures. 

[001 4] Accordingly, although certain trusted components can be deployed, users must continue to rely upon various 
unknown and untrusted elements and systems. On such systems, even if they are expected to be secure, unanticipated 
bugs and weaknesses are frequently found and exploited. 

[0015] Conventional symmetric and asymmetric encryption methods treat messages to be encrypted as basically 
30 binary strings. Applying conventional encryption methods to documents has some drawbacks. Documents are typically 
relatively long messages; encrypting long messages can have a significant impact on the performance of any appli- 
cation that needs to decrypt the document prior to use. More importantly, documents are formatted messages that r ly 
on appropriate rendering applications to display, play, print and even edit them. Since encrypting a document generally 
destroys formatting information, most rendering applications require the document be decrypted into clear form befor 
35 rendering it. Decryption prior to rendering opens the possibility of disclosing the document in the clear after the d - 
cryption step to anyone who wants to intercept it. 

[0016] There are a number of issues in rights management: authentication, authorization, accounting, payment and 
financial clearing, rights specification, rights verification, rights enforcement, and document protection. Document pro- 
tection is a particularly important issue. After a user has honored the rights of the content owner and has been permitted 
40 to perform a particular operation with a document (e.g., print it, view it on-screen, play the music, or execute th 
software), the document is presumably in-the-clear, or unencrypted. Simply stated, the document protection problem 
is to prevent the content owner's rights from being compromised when the document is in its most vulnerable state: 
stored, in the clear, on a machine within the user's control. 

[0017] Even when a document is securely delivered (typically in encrypted form) from a distributor to the user, it must 
45 be rendered to a presentation data form before the user can view or otherwise manipulate the document. Accordingly, 
to achieve the highest level of protection,, it is important to protect the document contents as much as possible, while 
revealing them to the user at a late stage and in a form that is difficult to recover into a useful form. 
[0016] in the- known appiozci,*; u electronic cjocumeni rlinrif uiien met einrlo\ encryption, an encrypted document 
is rendered in several separate steps. First, the encrypted document is received by the user. Second, the user employs 
so his private key (in a public key cryptosystem) to decrypt the data and derive the document's clear content. Finally, the 
clear content is then passed on to a rendering application, which translates the computer-readable document into the 
finished document, either for viewing on the user's computer screen or for printing a hardcopy. The clear content is 
required for rendering because, in most cases, the rendering application is a third-party product (such as Microsoft 
Word or Adobe Acrobat Reader) that requires the input document to be in a specific format. It should be appreciated, 
55 then, that between the second and third steps, the previously protected document is vulnerable. It has been decrypted, 
but is still stored in clear electronic form on the user's computer. If the user is careless or is otherwise motivated to 
minimize fees, the document may be easily redistributed without acquiring the necessary permissions from the content 
owner. 
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10019] While no system is completely spoof proof or immune to attack, some recent techmques protect dgtal worts 
by limiting use of the digital work to a user-specified physical devic . Thes techmques requ.re the us rtoprovje 
^vriLtion or system state information from ft system or physcal device ft user .ntends to use to render 
the dtaital work. Syst m state information is typically defined as system configuration ^formation such as system 
naram^ters CPU identifier, device identifiers, NIC identifiers, drive configuration, etc. In these techniques, the d.g tal 
content is encrypted using a session key. then the session key. rather than using the user's encr^on key, .s encrypted 
usinq a eolation of the system or state information and the user* credentials. Then both the encrypted con en 

a misted authorizing entity (usually a remotely located software program) which venf.es the use^s ^ identrty and cre- 
dentials, then together with system state, decrypts the session key and finally decrypts the content for use ■ 
roS Commercial applications such as the secure Adobe Acrobat reader and the secure Microsott Med«Player 
SL usage of fted^L work by checking a license voucher for the appropriate user credent.als and 
Tmong the user credentials are system device identifiers such as the CPU identifier or certa.n device senal numbers. 
ZZ time the user invokes an operation on the digital work, the application verifies if the speofied dev.ce is present. 
" ^ p^fs Turance ftat the digital work has not been transmitted to an unauthorized user (actually to an unau- 
fto^T^rSte fte pmgrammatfc check provides a minimal level of assurance, it depends on the secumy of 

theTeSeS 

themselves are particularly susceptible to the threat of spoofing. 

roS The Acrobat Reader and MediaPlayer protection schemes operate by allowing the rendenng apP'^on to 
ItireZ^devices on theusersystem as specifiedin the license 

a teve! ^ protection adequate in many circumstances (i.e., if the user is tmsted and the user's speafied 1 rendenng 
device ^ noTsusceptibte to attack). The weakness of the schemes that it is based on the assumpt.cn that nerfter 
the protection of the cryptographic key nor the integrity of the license voucher will be compromised. 

Tnesetechniquesare^^ 
user's identity and credential information, system state information is verified or license voucher receded the ^content 
^ecryptedTortsdearstateandthenb^omesvulnerableto attack. The dig^lwork. 
out usage. Further, the user information approach is problematic in that it assumes the user 
from passing along his/her persona, information. In other words, for the user information app oacf Ho succeed there 
must be severe consequences for users who would reveal their private identity and credential ^formation. 

A significant lawback to the schemes which tie authorization a specific device .s that they requ.re the user 
o divulge sensitive information (e.g., CPU number or other personal information whch n""*^ 
privacy issues. While the user divulges the information voluntarily (the user's only opt.cn rf he/s hedoe. 

this information not to recede the digital work) it would be desirable to 
could secure adiqital work on a user's device without requiring private information. It would alsobedes.rableto provid 
l£S£Z£ST*m not rev on the protection of the cryptographic key or the integrity of the 
ft would be desirable to provide a DRM solution which delayed decryption of the d.grtal content to the latest possibl 

r0024i nt Accordingly it would be beneficial to provide an electronic document distribution scheme that minimizes th 

40 distributed document during the decryption and rendenng processes. 

Summary of the Invention 

r002Sl A self-protecting document ("SPD"). according to the invention, is not subject to the above-stated disadvan- 
« aSs of fte pro? fining a encrypted document with a set of permissions and an executable code ^segment 
SSesToX^ 

accomplishes protection of document contents without the need for additional hardware and software. 
[00267 The SPD system is broken down between a content creator (analogous to the author and the pubhsher of the 
rad la. modeh and a content dilute, The amhor/puMisher create, the- cripina, ^^^^ 

u rights are to be permittee 1 ne a.suiouior men customizes tne oocumen, ,o. use oy vanous users, ensuring v.„ the 
customization that the users do not exceed the permissions they purchased. „ m u„ ri i m «.nt 
%S Attheuser'ssystem.theserf-protectingdocument is decrypted at the last poss.b.e m °7f '"^ n e ^° n d '" 1 r ^ 
of invention, varioul rendering facilities are a.so provided within the SPD, so <M«^«» 
upon external appltoation that might not be trustworthy (and that m.ght .nvite «"«* 0 '« d "^J" ct wttlthe SPDto 

« bodiment, interfaces and protocols are specified for a third-party rendenng appl.cat.on to interact w.th the SPD to 

ST rre r e e mb e o2en» of the invention, the encrypted document is decrypted by the use* ; syste. 

taneously "polarizing" it withakey that is dependent, at leastin part, on the stateof the user's system. The polar.zat.on 
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may be cryptograph icafly less secure than the encryption used for distribution, but serves to deter casual copying. In 
this embodiment, depolarization is perform d during or after the rendering process, so as to caus any intermediat 
form of the document to be essentially unusable. 

[0029) In another embodiment of the invention, a method of protecting a digital work uses a blind transformation 
5 function to transform an encrypted digital work into encrypted presentation data. The originator's digital content is 
protected in its original form by not being decrypted. This method enables the rendering or replay application to process 
the encrypted document into encrypted presentation data without decrypting it first. Encrypted presentation data is 
then decrypted just before it is displayed to the user. This method improves the overall performance of the process 
(both decryption and rendering) by minimizing the decryption overhead (since pre-rendering decryption is generally 
10 more time and resource consuming) and postponing the decryption to a late stage of the rendering process. 

[0030] Blind transformation or blind computing can be accomplished in one of several ways. Most digital works includ 
formatting information, which when encrypted cannot be processed by the replay or rendering application (the trans- 
formation function which transforms a digital work into presentation data). If the digital work is encrypted with a format 
preserving encryption scheme, any transformation function may be used. This is particularly useful in that any corn- 
's mercial replay or rendering application can process the encrypted digital work into encrypted presentation data. Oth- 
erwise, the blind transformation function is a function of the original transformation function. For example, the blind 
transformation function may be a polynomial of the original transformation function. Alternatively, both the blind trans- 
formation function and the original transformation function may be any multivariate, integer coefficient affine function. 
[0031] Not all encryption schemes are format preserving encryption schemes. Additive encryption schemes may be 
20 used with ail document types and all associated transformation functions. In some replay or render applications, for 
some types of documents, portions of the format information may be left in the clear. In other types of documents all 
of the format information may be encrypted. In some types of documents, an additive encryption scheme may be used 
to encrypt the format information and any encryption scheme may be used to encrypt the content or data portion of 
the document. 

25 [0032] In particular, additive encryption schemes can be used to encrypt coordinate information of documents so 
that some rendering transformations can be performed on the encrypted coordinate data. In a special class of docu- 
ments, token-based documents, for example, there are two places during the format-preserving encryption that use 
encryption schemes: one is for coordinate or location information x and y of the particular tokens within the document, 
and the other is for the dictionary of individual token images. In order to perform blind transformation on the individual 

30 coordinates of the particular tokens in the document, the first encryption scheme must be an additive encryption 
scheme. However, the token dictionary may be encrypted with any encryption scheme. 

[0033] An encrypted token dictionary may still leak information such as the sizes of the token images. If this is a 
concern (such as if the token dictionary is small), the tokens can be padded with some extra bits before encryption. 
The padding can result in encrypted token images of a same size or several fixed sizes. For a token-based document, 

35 the coordinate information of the tokens in the dictionary may not be encoded. If it is desired that coordinate information 
be encoded, say, as Huffman codewords, the same approach that is used to encrypt the identifiers can be used to deal 
with this situation. Basically, the codewords in location tables are left in the clear, and the codewords in the codeword 
dictionary are hashed using some one-way hash function and their corresponding coordinate information is encrypt d. 
During rendering the codewords in the location tables are first hashed and then used to lookup their encrypted coor- 

40 dinate information. 

[0034] In another embodiment of the invention, a digital work and a system context (or resource information or system 
resource) are polarized enabling trusted rendering or replay of the digital work without depolarization of the digital 
content. In this embodiment, the digital work is of the type which includes digital content and resource information. 
Resource information may include information used by a replay application to format or process the digital work into 

45 presentation data. Resource information may include, for example, a collection of system resources available to the 
replay software on a particular system, such as the Font Table ; Color Palette, System Coordinates and Volume Setting. 
[0035] Different types of digital works may be polarized. In addition to polarizing typical document type digital works, 
t..ucu oi.c viae-c oic iic.t ivc \r.i cr.r. te pc i;.n;.ec\ 1 ir- aiciir-: worv t.r.c syfie-iv. »■ ;<\\\£v\ ;'iiisli\ polf.ri.tfcc £i £ mam 
f acturer or content owner's location using a polarization engine. A polarization engine is a component used to transform 

so the digital work and system context to their respective polarized forms. The polarization engine employs a polarization 
scheme which relies on some polarization seed, an element used to initialize and customize the polarization engine. 
[0036] Various polarization schemes may be used to polarize a digital work. For example, a stateless polarization 
employs a random number as a seed to transform a digital work into a polarized digital work. A state-based polarization 
scheme employs a seed based on a system state or characteristic of a system to transform a digital work into a polarized 

55 digital work that is associated with that system state or characteristic. A dynamic state-based polarization scheme 
employs a seed based on a dynamic system state or characteristic to transform a digital work into a polarized digital 
work. In this embodiment, the polarized digital work will typically be provided with a polarization engine for repolarizing 
the encoded digital work and the encoded system cont xt according to the dynamic stat -based polarization scheme 
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made. The polarization seed .s selected and both the digrtal wor* ana me sy polarization 

rrexn5-= 

fSr^e discontent o,a r 

clear, the replay application will be ab e to process ^ ^^JJ^^,*, tuBa £ for viewing or use by 
means a depolarizer must depolarize the presentation data into dear P~"™JSy when the replay application 
the user. if a portion of a digrtal work's resource informal • ^^^^SS^^L^ 
transformsthepoiarizedd"^ 

the polarized digital work into clear presentation da a. A I or just a ' £™ " 1 J unp0 , ari2e d digital content, 
polarized. The replay is blind in that the replay apphcat on d ^^^'™!2lon using a polarized system 
[0040] in this embodiment, a polarized digital work '^^^^^S^T^cLm or third 
conte*(resourceinformation)^^ 

protecting the content throughout use. eventually decrypt the digital work 

?0041] Unlike systems which employ encryption to protect he digital work ^"^^J keeps ^ digita | 
nto its dear form before the digita. work is "M"^ £™£Z^tti!2» moment 

worken^dedinthepola^ 

KSSU it ca'nnot be easi* <Jf at all) transformed back ^^ff^ and replayed in a blind 
Sd reTa^ystem of the invention by polarization of an appropriate resource information. 
Brief Description of the Drawings 

[0043] The structure and function of the invention is best understood with reference to the induded drawings, which 

may be described as follows: 

FIGURE 1 is a ,op-,eve, blocK diagram repressing a mode, (or the creation and commercial distribution of elec- 
tronic documents in either secure or insecure env.onmenU, ^ ^ ^ ^ 

*• d ^ aon * protected eiec,ronfc documente t0 a preferred 

fun e itb.L diagram ii.ustrat.ng the data structures present in a serr-protec«ng document ac- 
SiS^rc^Sn a user's perspective, illustrating the actions performed in handling and using a 
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self -protecting document according to the invention; 

FIGURE 8 is a graph illustrating several possible paths between an unrendered and encrypt d document, and 
rendered and decrypted presentation data; 

FIGURE 9 is a flow diagram illustrating a polarization process according to the invention in which document format 
5 information remains in the clear for rendering. 

FIGURE 10 is a block diagram of a method of format preserving encryption and trusted rendering according to the 
invention; 

FIGURE 11 is a simple example of a document to be tokenized; 
FIGURE 12 is the token dictionary for the document of Fig. 11; 
10 FIGURE 13 is the location table for the document of Fig. 11; 

FIGURE 14 is a block diagram illustrating a process for generating a polarized digital work and polarized system 
resource according to the invention; 

FIGURE 15 is a block diagram illustrating the conversion of a digital work into image data according to the art; 
FIGURE 16 is a block diagram illustrating a system for blind replay of a polarized digital work according to the 
15 invention; 

FIGURE 17 is a block diagram illustrating another system of blind replay of a polarized digital work according to 
the invention; 

FIGURE 18 is a block diagram of an example structure of a digital document; 
FIGURE 19 is an example digital document; 
20 FIGURE 20 is an example of the digital document of Fig. 16 after it has been polarized; 

FIGURE 21 is block diagram of an example structure of a resource information or system context for a digital 
document; 

FIGURE 22 is a block diagram of an example font table; and 

FIGURE 23 is block diagram of the font table of Fig. 22 after it has been polarized. 

25 

Detailed Description of the Preferred Embodiments 

[0044] The invention is described below, with reference to detailed illustrative embodiments. It will be apparentthat 
the invention can be embodied in a wide variety of forms, some of which may be quite different from those of the 
30 disclosed embodiments. Consequently, the specific structural and functional details disclosed herein are merely rep- 
resentative and do not limit the scope of the invention. 

[0045] Figure 1 represents a top-level functional model for a system for the electronic distribution of documents, 
which as defined above, may include correspondence, books, magazines, journals, newspapers, other papers, soft- 
ware, audio and video clips, and other multimedia presentations. 

35 [0046] An author (or publisher) 110 creates a documents original content 112 and passes it to a distributor 114 for 
distribution . Although it is contemplated that the author may also distribute documents directly, without involving another 
party as a distributor, the division of labor set forth in Figure 1 is more efficient, as it allows the author/publisher 1 1 0 to 
concentrate on content creation, and not the mechanical and mundane functions taken over by the distributor 114. 
Moreover, such a breakdown would allow the distributor 1 1 4 to realize economies of scale by associating with a number 

40 of authors and publishers (including the illustrated author/publisher 110). 

[0047] The distributor 1 1 4 then passes modified content 11 6 to a user 1 1 8. In a typical electronic distribution model, 
the modified content 1 1 6 represents an encrypted version of the original content 1 1 2; the distributor 1 1 4 encrypts th 
original content 112 with the user 118's public key, and modified content 116 is customized solely for the single user 
118. The user 1 1 8 is then able to use his private key to decrypt the modified content 1 1 6 and view the original content 

45 112. 

[0048] A payment 1 20 for the content 1 1 2 is passed from the user 1 1 8 to the distributor 114 by way of a clearinghouse 
122. The clearinghouse 122 collects requests from the user 116 and from other users who wish to view a particular 
oix tmuwv.. ~i nc- citannoni us <- " d c-isi- colien? r.cyiv.eni inionv,atier 1; s uci . a;' c^bi: iiar,:: action;: . crtciii care trans ac • 
tions : or other known electronic payment schemes, and forwards the collected users' payments as a payment batch 

so 1 24 to the distributor 1 1 4. Of course, it is expected that the clearinghouse 1 22 will retain a share of the user's payment 
120. In turn, the distributor 114 retains a portion of the payment batch 124 and forwards a payment 126 (including 
royalties) to the author and publisher 110. In one embodiment of this scheme, the distributor 114 awaits a bundle of 
user requests for a single document before sending anything out. When this is done, a single document with modified 
content 11 6 can be generated for decryption by all of the requesting users. This technique is well-known in the art. 

55 [0049] In the meantime, each time the user 118 requests (or uses) a document, an accounting message 128 is sent 
to an audit server 1 30. The audit server 1 30 ensures that each request by the user 1 1 8 matches with a document sent 
by the distributor 114; accounting information 131 is received by the audit server 130 directly from the distributor 114. 
Any inconsistencies are transmitted via a report 132 to the clearinghouse 122, which can then adjust the payment 
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in charges that vary, depending on the duration or other extent o I us . ^ ^ 

document distnbution are shown. As ■^^°f™r_ d oubljclv and deciphered privately by authorized 

,0052) At M «M. •> ««* **«™« 2 J° « -T^*™ ^DAMWMMk So fee* a the 

document type. . . &rr>e .... frhic Thfi c i ear content 216 can be copied, 

10054] As dtecussed above, the document is ^^.^'^^^^'Z or the author/pub.teher 
stored or passed along to other users without **. M **°°"™T? S £ £££ document in the clear in 
,10. Even a legate user may ^"If ^ owners. As discussed 

encrypted documents at the user 118's system. A s.mple embodiment ^ *J™"^ to a d 9 tion step 312 
[0056] Figure 3 looks similar to Figure 2, in that an erupted ^"^^.^^3% ^ever, an addi- 
whJ us J a private key 314) and a renderiy P.^ 

tional layer of protection ^P«^^r^^^^JS^M*2^ of Figure 2) avalabfeto be 

paste operations) according to the user's permissions. intermediate ■polarization" step 

[0057] Figure4isamorescphisticatedvers,on.^ 
adapted to secure the document after ^^^^ 

the user's system. . trancfnnn the document to a version having polarized 

trom m. u«« »y«.m» sate, su» a. Jh. «JJ. "» ™? M-ved Bom „, ^, system. 

process sp«d.,,d,enaimmb.r^ 

cation 424. As discussed above, typical rendering ^^^.^^S^n,-^ not be able to process the 
Adobe Acrobat Reader. However, it is likely that such external (end ^^uT^^ renderer will have been 
polarized contents 420, as the contents, any formatung codes, and other cues used by 

scrambled in the polarization process. fault-tolerant), or it must receive po- 

[0061] Hence, the rendering applcatton 424 must ^^T^^^^^tJ^m,^ be dtecussed 
larized contents 420 that are largely complete and processable by the appl.cat.on. 

below, in connection with Figure 9. „,„„„, a ti n n riata 426 which has be n formatt d by the 

[0062] The output of the rend ring application is polarized presentation data 426, wn.cn 
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rendering application 424 but is still polarized, and hence not readable by the user. The polarized presentation data 
426 is passed to a depolarizer 428, which receives the polarization key 418 and r stores the original form of th 
document as pres ntation data 430. In one embodiment of the invention, the depolarization function is combin d with 
the rendering or display function. In this case, the polarized presentation data 426 is received directly by a display 

5 device, which can be separate from the user's system and receive data over a communications channel. 

[0063] Creation of the polarization key 418, the rendering application 418, and the depolarization step 428 are all 
elements of the protecting shell 422; these are tamperresistant program elements. It is contemplated that all compu- 
tational (or transformation) steps that occur within the protecting shell 422 will use local data only and will not stor 
temporary data to any globally accessible storage medium or memory area; only the explicit results will be exported 

w from the protecting shell 422. This approach will prevent users from easily modifying operating system entry points or 
scavenging system resources so as to intercept and utilize intermediate data. 

[0064J It should be noted that the presentation data 430 of Figure 4, in alternative embodiments of the invention, can 
be either device independent or device dependent. In the device-independent case, additional processing by a device 
driver (such as a display driver or a printer driver) typically is necessary to complete the rendering process. In the 
is presently preferred device-dependent case, the device-specific modifications to the presentation data have already 
been made (either in the rendering application 424 or the depolarizing step 428), and the presentation data 430 can 
be sent directly to the desired output device. 

[0065] The decryption schemes described with reference to Figures 3 and 4 above are enabled by a unique document 
structure, which is shown in detail in Figure 5. As discussed above, certain operations performed by the system and 
20 method of the invention require trusted components. One way to ensure that certain unmodified code is being used to 
perform the trusted aspects of the invention is to provide the code along with the documents. The various components 
of a self-protecting document according to the invention are illustrated in Figure 5. 

[0066] The problem of document protection is approached by the invention without any assumptions on the presence 
of trusted hardware units or software modules in the user's system. This is accomplished by enhancing a document 

25 to be an active meta-document object. Content owners (i.e., authors or publishers) attach rights to a document that 
specify the types of uses, the necessary authorizations and the associated fees, and a software module that enforces 
the permissions granted to the user. This combination of the document, the associated rights, and the attached software 
modules that enforce the rights is the self-protecting document ("SPD") of the invention. A self-protecting document 
prevents the unauthorized and uncontrolled use and distribution of the document, thereby protecting the rights of the 

30 content owners. 

[0067] The self-protecting document 510 includes three major functional segments: an executable code segment 
512 contains certain portions of executable code necessary to enable the user to use the encrypted document; a rights 
and permissions segment 514 contains data structures representative of the various levels of access that are to be 
permitted to various users; and a content segment 516 includes the encrypted content 116 (Figure 1) sought to be 
35 viewed by the user. 

[0068] In a preferred embodiment of the invention, the content segment 51 6 of the SPD 510 includes three subsec- 
tions: document meta-information 51 8 (including but not limited to the document's title, format, and revision date), rights 
label information 520 (such as a copyright notice attached to the text, as well as rights and permissions information), 
and the protected content 520 (the encrypted document itsetf). 

40 [0069] In one embodiment of the invention, the rights and permissions segment 514 includes information on each 
authorized user's specific rights. A list of terms and conditions may be attached to each usage right. For example, user 
John Doe may be given the right to view a particular document and to print it twice, at a cost of $1 0. In this case, th 
rights and permissions segment 51 4 identifies John Doe, associates two rights with him (a viewing right and a printing 
right), and specifies terms and conditions including the price ($10) and a limitation on printing (twice). The rights and 

45 permissions segment 514 may also include information on other users. 

[0070] In an alternative embodiment, the rights and permissions segment 514 includes only a link to external infor- 
mation specitying rights information . In such a case ; the actual rights and permissions are stored elsewhere., for example 
on i. heiwoi>:ec permit sit i.teivtt, wi.ich iv.usi re- c,uiriitc ttirn lime the cuCl:;vkwm is. !=: ; usec.l his approach ptovioej 
the advantage that rights and permissions may be updated dynamically by the content owners. For example, the price 

so for a view may be increased, or a user's rights may be terminated if unauthorized use has been detected. 

[0071] In either scenario, the rights and permissions segment 514 is cryptographically signed (by methods known 
in the art) to prevent tampering with the specified rights and permissions; it may also be encrypted to prevent the user 
from directly viewing the rights and permissions of himself and others. 

[0072] The executable code segment 512, also called the "SPD Control; also contains several subsections, each 
55 of which comprises a software module at least partially within the executable code segment. In one embodiment of 
the invention, the Java programming language is used for the SPD Control; however, it is contemplated that any plat- 
form-independent or platform-specific language, either interpreted or compiled, can be used in an implementation of 
this invention. 
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[0 o 73] AH g -nforcer524ispre^ 

and polarize the data according to the system state (o P™^^, before rt te st0 red or decked, so 
embodiment of the invention, the polarization ^^^rS^^^ «» is secured, that is. it b 
the document is never stored in the clear on th %f t 2 D ^ e ve^ngineering, and disassembling. 

[0075] A counterpart depolanzation engme 528 » also ^luaeo * objects roviding 

romtUolarizedcontent(seeFigu^^^^ 
areiativerytamper-proofinterfac^totherendenngAP^ 

window objects are resistant to be,ng ^^^l^^^Z opting system, 
can be reconstructed by intercepting and generation of clear presentation data 

[0076] A counterpart depolanzation ^ B ^£2^Sf^*t a relative* tamper-proof interface 
,rom the polarized content (see Rgure 4) . The de J^J^^^ £ input t0 the depolarization engine 528 is 

Kre Son which depends on for ^^^^^^ 

sUm^^ 

SST Final., a raring engine 532 is included or <^^^^ 
deringLgineLneednotbese^ 

2079] Tneforegoingaspecfcandeien™^ 

?s^r?rr;tS P ^ 

specification 614; and an optional watermark ; 616. by the author or publisner . 

[0081] The content 612 is pre-processed (step 61 8) to lay out tn« > a essentially "pre-ren- 

For example, a preferred page size. font. Mlpv^ ^£,2521 compatible w*h users' systems and the 
dered" in the content pre-process.ng step so that rt w I be .n a ^ be ( .. PDP) format 

SPD. For example, the content 612 may be conv ^ ed / h r ^ (F ' ur e 5). In one embodiment of the 

to a different format special* adapt,* to e ^ "P - ^ ** 

invention, multiple vers.ons of the "^J^^^^JL purC hased by the user according to his needs, 
generic SPD 610; those ^J^^SS,SS2S-I« of access rights are perm^sible. Such a 
[0082] The high-level nghts specrttcat on 61 4 sett fort ' deS c rib ing different groups of rightsfor drtferent 
rights specification istailoredtoaparuculardocum^ 

classes of downstream users. For examp £ $2 00 roya«y. S.mi,ar.y, users may be g.en the 
document M^^^^^^X one month, one year, or never. Severa. poss.ble 
opl ion to purchase a vers.on of the docume it the " below 

timitations are described with Terence to ad eta led ^ . ^ . gM£ |or digila| works . , 

(OOLSi Digital property hipr.if -r.cuage (DFhL, u d and en)orced , or rights. Rights spec- 

provides a mechanism in which ™^«™^-™^2£ Z ^r example. U.S. Patent No. 5,715,403 to Stefik, 
Ltions are represented as statements ,n f^"**™ ZSZ^a Attached Usage Rights Where the 

l sp Jcation, different sets of rights applicable to »^£225^ o condftJ, Conditions can be of 
cal ,ed "rights groups". Each right wtthm a ^^^Tlt^. typ of devfc on whteh the operation 
different types: fee to be paid, t.me of use, typ of access, type 
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can be performed, and so on. DPRL allows different categories of rights: transfer, render rights, derivative work rights, 
file management rights and configuration rights. Transport rights govern the movement of a work from one repository 
to another. Render rights govern the printing and display of a work, or more generally, the transmission of a work 
through a transducer to an external medium (this includ s the "export'* right, which can be used to make copies in the 
5 clear). Derivative work rights govern the reuse of a work in creating new works. File management rights govern making 
and restoring backup copies. Finally, configuration rights refer to the installation of software in repositories. 
An exemplary work specification in DPRL is set forth below: 

to (Work: 

(Rights-Language- Version: 1.02) 
(Work-ID: "ISDN-1-55860-166-X; AAP-2348957tuT) 
15 (Description: 'Title: 'Zuke-Zack, the Moby Dog Story' 

Author: 4 John Beagle* 
Copyright 1994 Jones Publishing") 

20 

(Owner (Certificate: 

(Authority: "library of Congress") 

(ID: "Murphy Publishers"))) 
(Parts: "Photo-Celebshots-Dogs-23487gfj" "Dog-Breeds-Chart-AKC") 
(Comment- "Rights edited by Pete Jones, June 1996.") 
30 (Contents: (From: 1) (To: 16636)) 

(Rights-Group: "Regular" 

(Comment: "This set of rights is used for standard retail editions.* 9 ) 
35 (Bundle: 

(Time: (Until: 1998/01/01 0:01)) 

(Fee: (To: "Jones-PBLSH.18546789**XHouse: "Visa"))) 

40 (Play: 

(Fee: (Metered: (Rate: 1.00 USD) (Pen 1:0:0) (By: 0:0:1)))) 

(Print: 

(Fee: (Per-Use: 10.00 USD)) 

(Trinifc: 

(Certificate: 

50 

(Authority: "DPT" 

(Type: 4 TnistedPrinter-6 f *))) 

55 (Watermark: 
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(Watermark-Str Titl : 'Zeke Zack - the Moby Dog' Copyright 
1994 by Zeke Jones. All Rights Reserved.**) 
(Watermark-Tokens: user-id institution-location render-name 

render-time)))) 

(Transfer. ) 

(Copy: (Fee: (Per-Use: 10.00 USD))) 

(Copy: (Access: 

(User. (Certificate: 

(Authority. "Murphy Publishers'*) 

(Type: "Distributor"))))) 

(Delete:) 
(Backup:) 

(Restore: (Fee: (Per-Use: 5.00 USD))))) 

ofa book titled "Zuke-Zack, the Moby Dog Story. ™ e J°*^^^ 

print, transfer, copy, delete, backup, and restore. The wort irr «"£JJ* se , of conditions that 

a chart of breeds incorporated from other sources^ J^jff^ ™ up are valid un ti. January 1 . 1 90B«J 
apply to all rights in the group. This SP^' 0 " 8 ^^;^ Searing house for this transaction should be 
that the fee should be paid to account tJm ^^!^^Too every hour, where fee is accumulated by 
Visa. The following contract applies:the work \^^^^^^^to r »M$^M^pn^^ 
the second; the work can be printed on TrustedPnnter-6 wh^ eh • mjomo , y inf0 rmation known 

printed copy should have a watermark string r*^^£i£S by ac^ulg a distributor certificate from 
at the time it is printed; this work can be cop.ede.the ' *PW * 0* «^ ^Jed (restoration costs $5.00). 
Murphy publishing; and unrestricted transfer J^^^p (step 620). in which the , hj- 

^egenericSPOe^ 

Used rights specification 614, ^"X^^ZZlS 610 may also optionally be encrypted by the 
mav be either visible or concealed within the SPD. me generic or 

author/publisher 110 for transmission to * e «*™™ZZV™ md „ stored for later customization. When a 
C00881 The qeneric SPD 610 is then received by the distributor c , earinqh0 use 122 or other interme- 

6 9 24 is receded by ,he distributor «^-^^^ b 2TS, both the user reaves, 624 
diary , me d -ib Ul o ;= s a se, o user P^^^^. lhen n0 lurther action is performed 

and the riqhts specification 61 a. ii mere iw nu t r-nc^' 

c r. ma. user, behaH (erne: mar. ' used K generate (step 630) a customized SPD 

,00691 The user permissions and tne user's public key 6^6 ere men. se 9 ^ jssions 

^ adapted to b/used by the user. ^ ^^^^J^^Lt in ^content segment 516 
segment 514 of the SPD 632. and the user's public key 528 » "*J the SPD from the generic form to the 

of ?he SPD 632. A public-key encryption mec ^^t^^o^^ Mtred T! 



can be used to decrypt it. 
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[0090] The resulting custom SPD 632 is then transmitted to the user 118 by any available means, such as via a 
computer network or stored on a physical medium (such as a magnetic or optical disk). 

[0091] The operations performed when a user rec ives an SPD are depicted in th flow diagram of Figure 7. The 
SPD is first received and stored at the user's system (step 710); in many cases, it is not necessary to us th SPD 

5 right away. When usage is desired, the user is first authenticated (step 712), typically with a user name and a password 
or key. The system then determines what action is desired by the user (step 71 4). When an action is chosen, the rights- 
enforcement step of the invention (step 716) verifies the conditions associated with the desired action (such as th 
fee, time, level of access, watermark, or other conditions); this can be performed locally via the SPD applet 51 2 (Figur 
5) or by accessing a rights enforcement server. 

10 [0092] If the rights enforcement step (step 71 6) fails, an update procedure (step 71 8) is undertaken. The user may 
choose to update his permissions, for example by authorizing additional fees. After the satisfactory verification of con- 
ditions, a pre-audit procedure (step 718) is performed, in which the SPD system logs verification status to a tracking 
service (e.g., the audit server 130 of Figure 1). The content is then securely rendered to the screen (step 722) as 
discussed above. When the user is finished, a post-audit procedure (step 724) is performed in which the amount of 

'5 usage is updated with the tracking service. The SPD system then awaits further action. 

[0093] The protection yielded by the SPD is derived from the user's inability to capture a useful form of the document 
at any intermediate stage during the rendering process. This is accomplished by decrypting the document contents to 
a clear form at the latest possible stage, ideally in the last step. 

[0094] The SPD decryption model is illustrated in Figure 8. E denotes the encryption function performed by the 
20 publisher, D denotes the decryption performed at the user's system, and R denotes the rendering transformation. Many 
prior systems use a first sequence of transformations 810, D(E(x)) followed by R(D(E(x))). As stated previously, the 
early decryption leaves the document in a vulnerable state. Ideally, the transformations are performed in the reverse 
order 812, R'(E(x)) followed by D(R*(E(x))). This postpones decryption to the latest possible time. 
[0095] The existence of R\ a rendering operation that can be performed before decryption, is determined by the 
25 following equality: 

D(R'(E(x)))=R(D(E(x))) 

so in case that the encryption and decryption functions are commutative, that is, E(D(x)) = D(E(x)) for any x, the existence 
of R' is ensured: 

R' (y) = E(R(D(y))) fory = E(x) 

35 

In practice, encryption and decryption functions in popular public-key cryptographic systems such as the RSA system 
and EIGamal discrete logarithm system satisfy the commutation requirement. This means that the transformation R' 
exists if these cryptographic systems are used for encryption and decryption. 

[0096] The path x* = D(R'(E(x))) portrays an ideal SPD solution to the document protection against unauthorized 
40 document usage and distribution. A scenario of distributing and using a document can be described as follows. When 
a user purchases the document, the document is encrypted using a user's public information and is transmitted over 
an insecure network channel such as the Internet. The encrypted document has the rights information attached to it 
and a protecting applet 512 that enforces the rights and permissions granted to the user by the content owner. Upon 
a user's request on using the document, the applet verifies the rights and permissions and generates from the encrypted 
45 document the presentation format of the original document. As any intermediate form of the document before the final 
presentation data is encrypted with the user's private information, the SPD model of document protection ensures that 
any intermediate form of the document is. not useful to other svstems wherever it is intercepted. 
1 0097 j Clearly, this ia&£.i iy.c-ol-. rents on whethei oi no: nu- iranf tonriciion R' tnai corresponds to the rendering 
transformation R can be computed efficiently, and in particular on whether or not an invocation of the decryption function 
so 0 is necessary during an implementation of R\ A trivial case in which R' can be implemented efficiently is where R is 
commutative with the encryption function E. When this happens, 

R'(y) - E(R(D(y))) = R(E(D(y))) = R(y) 

55 

for y = E(x). In this case, R' = R. 
[0098] Consideration of Figure 8 reveals that many intermedial solutions (e.g., intermediate solutions 814, 816, 
and 818) to the document protection problem may exist on the user's system between the two extremes x' = R(D(E 
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20 



«, e , « j -j rvMEteW which has ideal protection (under the assumptions 

level to the document. . decry p ti on to the last possible moment employs a 

(00991 As discussed above, one alternative method of delaying aec yv d0C ument as a whole. 

SSLon technique that encrypts on* the docuj-J £™^c^0 (which, it should be noted, does 
This possibility is shown in Figure 9. Beginning wrth the cle r *«n ^ ^ 

not exist in any single identifiable ^SZ^™u***^P^™™*»™ 

steo 41 2 of Figure 4). the document is split (step 912) into > a oa» po fo(mat rtK)n 91 6 . 

(stip 918) "sing «he po^n ^ nd me^ed Jjpfjg^ ^ ^ ^ 
This results in polarized content 924 tha can be secure than wholesale encryption wjth the 

content, it should be observed that Iln, the layout of a document, word iengths, line 

polarization key, since a lot of information can De ™ ^ copyngllt inf nn gement. 

fengths, etc.; however, this scheme w.ll ^*^£Z^„^ a blind transformation function . shown 
10100] A method of protecting a digital work ^^JJ^ 0 is provided to replay application 1012. Digital 
U reference* Figure 10. In Figure '^SSSS^^^ enab,eS ^ " o ° 8 

work 1010 has been encrypted w.th a ^^^^Jm data 1016 is then sent to decrypts engine 1018 
generate encrypted presentaUon data 1016. E^^f^Son data is now in the clear, but less likely to be 
where it is decrypted into cl^ 

to display device 1 026. transformation can be stated as follows. Suppose a client Cathy 

[0101] in a general context, the V 0 ^*^™*^^ n is (public or private) data a and her onvate data 
Lents a server Steve to compute for her a «^*££2£J?h & wKhout Steve knowing her private date > x 
x and Cathy wishes, for privacy concerns that the ^ he cornputes F(a ,x) for Cathy but with his 

and the function value F(a,x). From Steve's point °'^.^ S X sieve to perform the transformation only wrth 
eves blindfolded. What this means is that Cathy would " «J£ ° Ek(F ( a ,x)) again encrypted using her key 
Zt Z) encrypted using Cathy's key k, ^^ff^^ S^ hf avoided 9 disclosing the data x in the 



30 



35 



40 



45 



F(a,x)<- irr -FXa,E(x)) 



55 



[0102] A protocol for blind transformation can be aescnoea 

(0 Cathy encrypts x using her encryption Key k, resulting 6,1*). 

S:e2^ -Lo « K o, — «■ at tne . - a ano encrypteo oat, fc*(x). 

^ „ h« r0 r a n be reaarded as a generalization of blind signatures 
[0103] Theidealmodelofblindtransformationntmdu^ 
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that the server computes private to the client, but they differ in that the client supplies the data input and the server 
supplies (a program that evaluates) the function in blind transformation, while it is the oth r way around in secur 
mobile computing. Note that blind transformation allows som portion of the data (e.g., a) to be in clear. This enables 
us of some dynamic yet clear data in the rendering proc ss, such as display window size, r ference positions for 

5 shifting content, scaling factor and coefficients in a rotation operation. 

[0104] Blind transformation works only if there exist functions F and P to compute the encrypted data. It can .b 
shown that multivariate: integer coefficient affine functions using additive encryption schemes permit many document 
rendering functions of the affine type on the x- and y-coordinates to be evaluated in blind transformation. For a given 
encryption scheme S, a function F: X -> X is said to be S-biindly computable if there exists some function P : X -» X 

10 such that the computational complexity for evaluating F is a polynomial of the one for evaluating F, and 

F(a,x) = D k " 1 (P(a,E k (x))) 

is for any k G K and xGX.A function F: X X is said to be blindly computable if there exists an encryption schem S 
with X being a subset of its message space such that F is S-blindly computable. 

[0105] Any multivariate, integer-coefficient affine function is S-blindly computable for any additive encryption scheme. 
Specifically, let 



25 be a multivariate affine function with a constant XqGX, integer coefficients a, and variables x v ... x k in X. Then, for 
any key k G K, there exists a computationally efficient function 



such that 

40 Indeed, the constant y 0 and integer coefficients b, in F Wtbl ^ can be taken to be y 0 = E k (Xo), bj = a jt i= 1 , k. The 

blind transformation of multivariate, integer coefficient affine functions using additive encryption schemes allows many 
document rendering functions of the affine type on the x-and y^oordinates to be evaluated in the blind manner, pro- 
viding a theoretical foundation for the format-preserving encryption and trusted rendering of documents described 
herein. 

45 [0106] A document is usually a message that conforms to a certain format. For document encryption : in addition to 

simply encrypting the entire document, there are many different ways to encrypt only some parts of the document. The 
aoal here is that the information leakage about the unencrypted portion cannot be used, or if it does leak : it is compu- 

WMici'ir.ih ciiftiCLfli u, reccnj sum; , iht- c\ez... eriunai ccmmer.. 

[0107] If an encryption scheme which preserves formatting information of the digital work, then any transformation 
so function (replay application or rendering application) may be used. An example of a format preserving encryption meth- 
od is described for convenience with reference to token-based documents. The method for format-preserving encryp- 
tion can be easily extended or applied to documents in other formats (such as HTML/XML, Microsoft WORD, Acrobat 
PDF, etc.). In a token-based format such as the Xerox DigiPaper, each page image of a document is represented as 
a ■dictionary'' of token images (such as characters and graphics elements) and location information (indicating where 
55 those token images appear in the page). Thus, multiple occurrences of the same token in the document can be rep- 
resented using just a single image of that token in the dictionary. 

[01 08] The process of rendering a document in such a format is then accomplished by consecutively reading in token 
locations, retrieving images of the tokens from the dictionary and drawing the images at the specified locations. The 
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bene^oftoken^eddocumen^^^^ 

T of size ITl. together with a sequence of Ptables rt ^ tokM . Each ^ L«. 1 s 

Each entry TIj], 1 i\S m. is a pair («dU,tQ] ? ' * ■"^'jfjg ^ he k-ih token occurrence in the i-lh 
It s IL* in the i-th image locat.on table L, » a ^^^1^^ y^oordinate differences from the previous 
page image, where id[k]st« 

(k-1)-th token occurrence in the page. For example, taKe m ° 8 "^ _ 12 nd 13 respectively. 

LLtion«ab.e(usingx. y coordin^ 

[0110] The schematic pseudo-code Render(D) below snows now - pag » Lo okup(T,id[kl) is a subroutine 
!ne code, xo, y 0 are the base references ^^^^^^ < in T corresponding to the 



so 



Rendei(D) 
{ 

Load T into memory 
for i = 1 to P do 
{ 

Load Li into memory 

x = Xo 

y=yo 

f or k = 1 to LI do 

{ 

x = x + x[k] 
t = Lookup(T4dM) 

40 

Draw(x,y,t) 

} 

} 

} 
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t - ■ * 4. - v = v + b as used in the schematic rendering process 
[0113] Rotation. The rotation transformation is 
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5 



for some constants a, b, c, d, which form a 2-by-2 rotation matrix. This transformation is needed when the page imag 
is rotated. 

[01 1 4] Affine Transformation. An affine transformation is one of the form x = ax + by + e; y = cx + dy + f for some 
constants a, b, c, d, e, f. In the vector form, it is: 



Clearly, shifting, scaling and rotation transformations are special cases of affine transformations. It is those affine typ 
transformations that make it possible to achieve a high-level trusted rendering under encryption of coordinate infor- 
mation using additive encryption schemes described below. 
20 [0115] A special class of encryption schemes, namely, additive encryption schemes, are used to carry out blind 
transformation of functions of the affine type, which provides a foundation for trusted rendering of documents. Blind 
transformation by a rendering transformation R and R' of an encrypted document satisfies the relationship: D<R'(E(x))) 
= R(D(E(x))), where E is an encryption function and D is a decryption function for E. If E(x) is an additive encryption 
scheme, then R' = R. 

25 [0116] An encryption scheme S generally consists of basically five components: (i) a message space X which is a 
collection of possible messages, (ii) a ciphertext space Y which is a collection of possible encrypted messages, (iii) a 
key space K which is a set of possible keys, (iv) a computationally efficient encryption function E : Kx Vand (v) 
a computationally efficient decryption function D : K x V-» X . For each key k G K, there is a unique key lc 1 .G K, such 
that the encryption function E k = E(k t ) : X -* Y and decryption function D* 1 = D(kr\) : V-> X satisfy that, for every 

30 message x G X, D* 1 (E k (x)) = x. The key k is cailed an encryption key and k" 1 its corresponding decryption key. 

[0117] Such defined encryption schemes can be varied in several ways to cover a wide range of concrete encryption 
schemes used in practice. One variation is to consider whether or not keys used for encryption and decryption are 
different In the case where all encryption keys k are same as their corresponding decryption keys le 1 , the scheme is 
a symmetric (or private-key) one; otherwise, the scheme is asymmetric. In the case where, for all possible k, k" 1 is 

35 different from k and computationally difficult to derive from k, the scheme is a public-key encryption scheme. 

[0118] Another variation is to differentiate deterministic and probabilistic encryption schemes. In a deterministic 
scheme, all the encryption and decryption functions E k and D*.-, are deterministic functions, while in a probabilistic 
scheme the encryption function E k can be non-deterministic, namely, applying the function to a message twice may 
result in two different encrypted messages. 

40 [0119] An additive encryption scheme is an encryption scheme whose message space X and ciphertext space Y 
possess some additive structures and encryption function E k =E{k,) :X-> Vis homomorphic with respect to the additive 
structures. Specifically, let X = (X, +, 0) and Y = (Y,©,0) be two commutative semigroups with (possibly different) zero 
elements 0 satisfying, for example, for all x, x + 0 = x and 0 + x = x, and efficient operations + and ©. An encryption 
scheme is said to be additive if, for any k G K and any x, x" G X, E k (x + x') = E k (x) © E k (x'), and the operation © does 

45 not reveal the clear messages x and x\ The last condition on © makes additive encryption schemes non-trivial. Without 
this condition, the operation @ on Y can be trivially defined y © y' = E k (D M (y) + D^y')); that is, it is accomplished by 
first decrypting the araument?. then adding them together and finally re-encryplinc the result. 

[uiiu] Closely nMai'ec u- txc wwe i-r.ciypiion scnemei iv^liir hc£iive viut: . .»n Encryption scheme is said tc be 
multiplicative if its spaces X and Y have the ring structures (i.e., in addition to their additive structures, they have 
so respective multiplications x and ® that are distributive over their additions + and ©, and multiplicative identities), the 
encryption function E k is homomorphic with respect to the multiplications, E k (x x x*) = E k (x) ® E k (>0; and the operation 
® does not reveal the clear messages x and x\ 

[0121] In general, additive (as well as multiplicative) encryption schemes are not non-malleable, since a non-malle- 
able scheme requires that, given an encrypted message it is (at least computationally) impossible to generate a different 
55 encrypted message so that the respective clear messages are related. Accordingly, they have a weakness against 
active attacks where the adversary attempts to delete, add or alter in some other way the encrypted messages. How- 
ever, when these schemes are used to encrypt documents, extra measures in data integrity and message authentication 
can be taken to reduc risks caused by these active attacks on document integrity as well as confidentiality. Moreover, 
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end users are less mooted to MU. active attacks, as the attacks wi.. affect document contents that the users are 

going to use and consume. d nat(jra| manne r. In fact, some 

[0122] Not all encryption schemes can be defined as additiv or i s _i i an y ^ 
Encryption schemes are designed with a requirement in the method of 

addiSe. Nevertheless, there are many ^examples of ^^SS^Si SeTdeterministic schemes), OU 

may be used in the format preserving method. e/ * amo wh pro X - Y = Zl = (0, 1 , ... n-1} for some 

[01231 Multiplicative Cipher (Mult) is a symmetnc encrypt.on scheme, where X - Y - ^ <u, 
integer n > 0. The encryption of a message x using a key a is 



E a (x) = ax(mod n) 



75 and the decryption of a message y using a key a is 

x = D a (y) = a" 1 y(mod n), 



20 where a-1 is the multiplicative inverse of a modulo n. cphertext space Y = Zp for some 

function is defined as the exponential function 
25 E g (x) = g x (mod p), 

while the decryption function is defined as the logarithm function 
30 D g (y) = log g y(mod(p-1)). 

[0125] Semi-probabifctic E.Gamal Cipher (EG) extends the -J"""*^? ^e^MW^S 22 
K EIGama. /pher to run in a semi-probabilistic ^^ Q ^ 0 ^ 0 TZi a random number a e 

numberrGZ^: 

E a (x,r) = (g r (mod p), xa r (mod p)) = (s,t). 
For an encrypted message (s, t), the decryption function is defined as 

D u (s,t) = t(sV (modp). 

E a (x, r) @ E a (x', r) = (s, t) © (s, f) = (s, t + f) = E a (x + x'(mod p), r). 
[0127J This partial* defined ope^ 

numberr „ u ™, amn tn and Uchivama proposed an additive, public-key encryption 

55 [0128] Okamoto-Uchyama Cipher ^^^^^^^ as Secure as Factoring". EurocrypV98. 
scheme in T. Okamoto and S. Uch,yama. , New Pubhc ^ ^osyst ^ ^ ^ 
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n = p2q. Choose g G Z* n at random such that the order of g p = gP- 1 (mod p 2 ) is p. Let h = g n (mod n). The message 
space X of the OU scheme is the set Z* p (notthe set {1 ,...2 k - 1 } as claimed by Okamoto and Uchiyama) and the ciphertext 
space Y is Z^. For a user, a public key is a tuple (n, g, h, k) and its corresponding private k y is the pair (p, q) of the 
primes. To encrypt a message x e X, a random number r £ 7^ is chosen uniformly. Then the encrypted m ssage is 

5 

y = E (nghk) (x I r) = g ,t h r (modn), 
To decrypt the encrypted message y, a "logarithmic'' function L:T -» r, 

10 

L(x) = (x-1)p" 1 (modp 2 ) 

is used, where r is the p-Sylow subgroup of Z* p2 , i.e., r = {x e Z*^ I x = 1 (mod p)}. With the function L, the decryption 
is function is 

x = D pq (y) = UyP" 1 (mod p 2 ))L(g p )* 1 (mod p 2 ). 

20 [0129] New additive encryption schemes can be constructed from existing ones via the composition construction of 
encryption schemes. The composition construction can also be used to construct additive encryption schemes from 
non-additive ones. For instance, the composition of the exponential cipher Exp and any multiplicative encryption 
scheme S (such as RSA) results in an additive one. 

[0130] Additive encryption schemes enable blind transformation with partially encrypted data, which serves a foun- 
ds dation for trusted rendering of documents, as discussed above. In particular, additive encryption schemes can be used 
to perform blind transformation of affine functions with clear coefficients and encrypted variables. 
[01 31 ] Returning to the example of a token-based document, since a token-based document D consists of a dictionary 
T of token images and a sequence of location tables L } (one for each page image), the idea is to encrypt the content 
of the dictionary T and location tables L |t resulting in a dictionary T of encrypted token images and tables L'j of encrypted 
30 locations. Recall that the dictionary T consists of a collection of pairs (id[j], fljl), j = 1 , ... IT1. Associated with T is a 
subroutine Lookup in the rendering process that, given a valid token identifier id ; returns its corresponding token imag 
t in T. In encrypting the dictionary T, there are three basic choices: encrypting token identifiers, token images, or both. 
Encrypting either identifiers or token images helps unlink the connection between the identifiers and their token images. 
In addition, encrypting token images protects proprietary token images. In any case, it is desirable to allow valid access 
35 to the dictionary only within the rendering process P, while making it computationally difficult to obtain a copy of the 
entire, clear contents of the dictionary. This is possible because in many cases the valid identifiers (e.g., Huffman 
codewords) are only a very small subset of all binary strings of up to a certain length, and consequently any exhaustive 
identifier search will not be efficient. 

[01 32] More formally, given the dictionary T and the Lookup subroutine that accesses it, the requirement on encrypting 
40 the dictionary is that the encrypted dictionary T and the corresponding subroutine Lookup' satisfy the following con- 
straints: 

(1) For any encrypted identifier E k (id) : Lookup'(T,E k (id)) = E k (Lookup(T,id)) and 

(2) Given T and Lookup', it is computationally infeasible to reconstruct T. 

45 

[01 33] For an encryption scheme S : T and Lookup' can be constructed as follows. Let ID be the set ol all syntactically 
possible identifiers; in particular ID*c ID : where ID* = {id I (id : 0 s T). Let h be a one-way hash function whose domain 

i; !L . u<-.;. ihc ehcry;-;ec tc-fcer, c\c\i(;uzr : ' n c.^Mvec' in..-,. "= as icliov.*: : k i cvh; <ic : t ; pc.ii in h pfei: (hficTrX^O) u 
inserted into T\ The modified subroutine Lookup 1 uses the algorithm: 

50 
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Lookop'CTAd) 
{ 

id'*h(id) 

t' = Looknp(r4d') 

return (0 

} 



Nofcethatthe return va.ue of Lookup' is an encrypted -^^^JSffl^Sr - * 

~s^^^ 

T and Lookup'. identifier, and location difference in x- and y-coordinates, 

20 [0135] Since each entry in a location table L, consists ot an nra . jnfolTnationi an additive encryption 

any combination of the three e.ements can be enciypted To enc^pt *e ^^ ,n ^ ^ ion w(Mnates . 
scheme is recommended to enable applying any rendenngtrans^ 
For identifiers, a trade-off between document compre^ 

document, a token identifier is usually a codeword of some coding scheme ^or thee p ^ ^ ^ ^ 

when the Huffman code is used to compress a deterministic encryption 

tokens based on their occurrence frequency inthe dement. I h«, J.,^^ notchange 
schemetoencryptlhese identifier offers no °«^^ 

the occurrence frequency of each token, and ^^^ n .2^^ etBiore m orde r to hide occurrence fre- 
identfiers to reconstruct the Huffman codewords that a ^' d ^ 
quendesofthetokens in the document it ispreferred^ 

r^:rm d r e ^ 

modified to reflect the change. Hftrnment mentioned above, the document content 
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Rcndci<D) 

{ 
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fori= 1 toPdo 

{ 

5 

Load L< into memory 
x = Ek(xo) 

w y=Ek(yo) 

fork= 1 to KJ do 

{ 

15 x = x©x[k] 

y = yey[k] 
t = Lookup'CT4d[k]) 

20 

Draw'Cx.y.t) 
} 

} 

} 

Draw'(x,y,t) 
{ 

x = Du(x) 
y=Dk-i(y) 

35 

t = Dk-i(t) 
Draw(x,y,t) 

> 

During the process, all the coordinate and token image information remains encrypted before calling the subroutine 
Draw'(x,y,t). This is possible for the coordinate information because the encryption scheme is additive. Consequently, 
the content protection level and rendering process performance of the rendering process rely on the security strength 

« and computational complexity of the scheme used. 

[0138] In another embodiment of the invention, a digital work is polarized enabling trusted rendering or replay of the 
digital work without depolarization cf Ihe digital content or the presentation date. In this embodiment, the digital work 
is ihe type which induces cicsU:.. i c nieni anc resource mionv.c.uc ic:isl caliec l synem context). Resource informatior. 
includes formatting information or other information used by a replay or rendering application to convert the digital work 

so into presentation data. 

[01 39] Polarization is a type of transformation which renders the original content unreadable or unusable. For a digital 
work w, a polarization scheme T, which uses a seed s, generates a polarized digital work w' according to: w' = T(w, s). 
The same transformation T may also be used to generate the polarized resource information S' according to S' = T(S, 
s). In this example, a seed s is used to make reverse engineering of the polarization scheme more difficult. 
55 [0140J f or example, a document type digital work may be polarized using a simple polarization scheme. In a docu- 
ment, the digital content comprises a series of characters in a particular order or location. If the document is to be 
displayed on a viewing device, each character must be able to be displayed at a particular location for viewing by a 
user on the viewing device, such as on a monitor A coordinate system is required for displaying each character on 
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h ^hora^tar in the document can be displayed on the monitor. The digital content contains coordinate 
ization functions may be used to polarize the above paragraph: 



20 



25 



30 



35 



40 



Y = bv, for the vertical axis; and 
X = x/a, for the horizontal axis. 



£^2££Ln tunctions are used to compute both x and y .ocations o, a g.en pent: 



Y = logt(Y), for the vertical axis; and 
X = aX, tor the horizontal axis, 



where log* is the logarithm with base b. character in the polarized digital work, the location is 

[0143] When the replay application obtains the location of a character w . me po a 
givJby (X ,Y) = (x/a. bv). This value is then , applied «™ * ** 

the correct location of 'P is displayed on the user's monitor. In bo* i cases o _ po*™ , ^ P ^ 
resource information and the digital work main tarn ar ' dig ita. work. 

rh=& 

quality of the polarization seed may also be used m < ^^°^J^2Id(auch as one containing authorization 

context. nreferably provides information from the user system in which 

Ker=^^^ 

polarized digita. work and the polarized resource ,rt °^ n ^mlZZe pr^ to the user. 

The polarized digital work and polarized ^2Si5o^^ 

A.so,ty P ically,bu.no,neededforoper a t,on oft ^J^^^^' 1 ?^ m poianzed digita, work and 

encryption scheme usee. ^ ^ __ L . it fijvjcec |r(K . th{ -, 5iers - fieri are cene.auor, c 
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the digital content is polarized and the resource information is preserved, creating polarized digital work 1422. The 
content polarization 1420 may occur as shown with reference to Figure 9. A digital work typically includes cont nt, 
instructions and formatting. While polarization can occur to the entire digital work, preferably only the content is polar- 
ized; the instructions and formatting are not polarized. However, in some instances, for some replay applications, som 
5 of the resource information contained within the digital work may also be polarized. This is similar for the format pre- 
serving encryption method described above. 

[0149] Resource extraction 1412 extracts at least one resource information from the set of resource information 
associated with digital work 1410. Extraction consists of copying the resource information into a system resource file 
1414. System resource 1414 is then polarized at resource polarization 1416 to become polarized system resourc 
10 1424. The polarization scheme for content polarization and resource polarization need not be the same. Preferably, 
each polarization scheme employs a polarization seed 1418 which is generated by seed generator 1426. Several 
exemplary methods for seed generation are described below. In particular, in a preferred embodiment, the polarization 
seed is based on unique information from the user's system. 

[0150] Several techniques for generation of the polarization seed may be used. For example, a seed generator which 
is generates a number from a random number generator may be used. This method, referred to as stateless polarization, 
does not depend on any secret key information and user system information. The process for stateless polarization 
yields a specific value for the system for polarization. The inherent vulnerability for digital security systems may be 
found in mishandling secret information, mathematical complexity, and algorithmic complexity. Eliminating the secret 
information seals off one target of attack. With stateless polarization, a random number generator produces the polar- 
20 ization seed. In this case, once the polarization process is complete the seed is discarded without a trace. Hence, the 
security of the system is free from attack focused on compromising the secret information, and the user need not 
divulge sensitive information that may be deemed a privacy violation. 

[01 51] Another seed generator that may be used is a state-based generator. The state-based seed generator con- 
structs a seed by first acquiring system state information from the user's replay system or rendering device. System 

25 state information includes hardware identifiers, system settings and other system state-related information. While there 
is much value in stateless polarization, other security requirements may require use of an inseparable linkto a particular 
user system or device. By generating the polarization seed from system/device-specific information, the polarization 
engine will produce a digital work that is polarized to a form that corresponds to a specific system/device. 
[0152] The polarization seed generator can also be tied to an authorization process. In authorization-based polari- 

30 zation, the seed generation can be tie in with the outcome of the authorization process. A separate authorization 
repository (which is a trusted source) provide authorization information as part of some other security feature associated 
with delivering access to a digital work to a user. The trusted source of authorization information may be an online 
authorization repository as described in US Patent No. 5,629,980. This authorization information is then used to gen- 
erate a polarization seed. 

35 [0153] if a stateless polarization seed is used, the digital work and its resource information may be polarized and 
stored together for delivery to a user when a user purchases the associated rights of use forthe particular digital work. 
If one of the other polarization seed generation methods is used, polarization typically must wait until the user provides 
the system state or authorization information before the digital work and resource information may be polarized. 
[0154] An embodiment which provides a higher level of protection in terms of ensuring that the digital work may be 

40 replayed only on a specific physical system or device uses a dynamic state-based polarization seed. In this embodi- 
ment, a polarization engine and polarization seed generator must be provided to the replay application or rendering 
device along with the digital work and resource information. In this embodiment, the digital work and resource infor- 
mation are polarized prior to replay and rendering using a seed which is generated based on the dynamic state of the 
particular system or device. The dynamic state may come, for example, from the system dock, CPU utilization, hard 

45 drive allocation, cursor coordinates, etc. By polarizing the work using a snapshot of a dynamic state, the work is locked 
to a particular system configuration (i.e. : state) in time. Polarization of the digital work, and ultimately its blind replay 
(described below) : is based upon a dynamically evolving stale. The evolution of the dynamic state does not yield unique 
<eciei iniorinottor, tnsi c.Ik=v. ? rr-rp&e.iaoiiitv of int poioiUcuioi. r roc-eei . snc hence dynamic-state based polarization 
makes compromising the polarized digital work and system context more difficult. Since the polarization process is 

50 carried out within a trusted system, it is implied that the process can not be deconstructed. 

[0155] The actual process of polarization can be, as described in the example above, an algorithmic-based trans- 
formation -parameterized by the polarization seed. During polarization, the data and resource identifiers of the digital 
work are transformed as described above. The structure of the digital work is unaltered, however, such that the original 
format, such as PDF, DOC, WAV, or other format, is retained much like in the format preserving encryption. Similarly 

55 the polarization of the resource information yields a polarized form of the resource information such that the resource 
identifiers, element identifiers and resource characteristics are transformed, yet the structure of the system context 
remains unaltered. By polarizing the digital work and resource information according to the same seed based on a 
user's specific device or system information, an inseparable relationship is established such that the work cannot be 
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rep , ay edto«sc.earfor^ 

remains in effect. ^harartPristics of the polarized r source information enabl the replay appli- 

[0156] ^nng b.ind rep ay, th^n^ 

cation to properly replay the polanzed digital work „ Jo, ementa rv manner the polarized elements of the 

digital workand the resource information^re^ 

other protection for direct use by any user. nmicallv stored in memory. If the digital work is 

1 512. in the case of a document or other type of digital work whc i requm Mronn 3 

mation, the digital work will include digital content plus '•^J^^^^, lh . d i,t« k l610 

into presentation data 1514. presentation data is sufficient for use by the 

[0160] in some replay applteations, converting the d.grt al ' JTJ converted. For example, in the 
user, in others, presentation data is only an intermediate fonr whch ^nujt b. further conve ^ P rendering 
case of a display system 1524 which is a printer, the P"»£»"^ SCJi^**o •***»»" 1518 
application 1518. Rendering application 1518 may be a ^^^"^^^ 1520 . image data 1520 is 
usL other system resources15^ 

in a form which can be directly displayed on .display ^^^^J ™ « 0 rtc during replay, a digital 
[0161] in addflon to the r^f m ^^'£^^^^^ Polarization scheme which 
work may be protected dunng replay by poizing the Zl^ a1ion A portion of the digital work's resource 

P^po.arUedcont^ Refef , ng * Q Rgure 16 , replay 

information is copied and polarized in Slid aw other system resource information 1616 that 

digital content into a clear .mage (presentahon data). This ; system n a P y y es 
wLhcan beany commercial ^ "T^KJSS digital content, w is the clear 

for any transformation function R, such that R(w£) - R j w /]' * h ^ e J |arize P d resource inf0 rmation. Blind replay of 

- ^r^^^^^rj^^; --^-p^f^g's^ :r^Sr^^ c Fr^^ 

ft. p., M »d «H «* «l Won™*, The — M » 

ized resource information. wiou/ina bv a us r can be used to further 

[0164] Th complexity of rendering a digital work into a usable form for v.ew.ng Dy a us r ca 
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protect the digital work during replay. Referring to Figure 1 7, polarized digital work 1 71 0 is provided to replay application 
1712, which uses polarized system resources 1716 and other system resources 1718 to transform polarized digital 
work 1 71 0 into partially polarized presentation data 1 714. In this embodiment, display system 1728 is needed to trans- 
form presentation data into a form usable by the user. Partially polarized presentation data 1 71 4 is provided to rendering 

5 application 1720 which uses polarized system resources 1716, local system resources 1722 and system resources 
1 71 8 to transform the partially polarized presentation data 1 714 into clear image data 1 724. Clear image data 1 724 is 
then displayed on display device 1726 for use by the user. In this embodiment, presentation data is still polarized, 
taking the location of the clear data to a later point of the display process and providing further protection. 
[0165] To enhance usability of the system for polarization of digital works, the polarized resource information may 

w be separated from the digital work and tied to a transportable device such as a smart card. In this embodiment, the 
replay application 1 71 2 plays back the work using the polarized system resources 1 71 6. Instead of having the polarized 
system resources 1716 stored in a local memory, along with the polarized digital work, 1710, the polarized system 
resources 1 71 6 is stored in a transportable device such as a smart card. Also, the smart card, possibly with hardware- 
enhanced features, may possess attributes that provide for tamper resistance. Within the transportable context, the 

15 polarized data is processed by the replay application 1 712 to yield the partially polarized presentation data and then 
provided to the rendering application 1 720. 

[0166] Many different types of digital works can be protected throughout use using the polarization method. For 
example, if the digital work is a document or text file, the replay application may be a word processor, system resources 
or resource information may include font tables, page layout, and color tables. If the digital work is audio or video data 
20 (e.g., streams), the replay application may be an audio or video player. The presentation data will be the audio/video 
final data stream. The display system may be an audio/video device. The rendering application may be the audio/video 
device driver. The image data may be the audio/video device data stream and the display device may be the audio/ 
video rendering device (speaker or monitor, for example). 

[0167] For a digital work that is an audio/video data stream, the system resources or resource information may 
25 include characteristics of the audio/video device: sample rate (samples per second - e.g., 8 kHz, 44.1kHz), sample 
quality (bits per sample - e.g., 8, 16); sample type (number of channels - e.g., 1 for mono, 2 for stereo), and sample 
format (instructions and data blocks). A table of some audio/video data streams and their corresponding resource 
information or variable parameters which can be selected for polarization is set forth below: 



Table 1: 



Digital Work: A/V Data (Streams) 


Extension 


Origin 


Variable Parameters (# Fixed) 


Compression 


Player 


.mp3 


MPEG standard 


sample rate, quality, #type 


MPEG 


MP3 Player 


.ra 


Real Networks 


sample rate, quality, #type 


Plug-ins 


Real Player 


.wav 


Microsoft 


sample rate, quality, #type 


ADPCM 


Window Media 


.snd 


Apple 


sample rate, #quality, #type 


MACE 


QuickTime 



[0168] The structure of a digital work can be used advantageously for polarization. While it is possible to polarize 
the entire digital work, it is more convenient to polarize only a portion of the digital work. Most digital works include 
three primary elements: instructions, data, and resources. Preferably, only the data and resources of the digital work 
are polarized, much like the format preserving encryption method described above. By selectively transforming only 
the data and resources, a digital work may be transformed such that the content remains in the original format, yet the 
data and resources are incomprehensible. 

[0169] The general layout of a digital work of the document type is shown in Figure 16. in Figure 1 8, digital work 150 
inducer Face Pescriptci U-r. . Gcntrcl Cocef M 16F eric' 162. Resource loer.tifie: 1H . anr! Patfc 160 £nc' 164. The 
Page Descriptors 152 define the general layout oi a work. Foi instance, tne page size, page number anc margins tali 
into the category of Page Descriptors with respect to digital documents. Control Codes 154, 158 and 162 are similar 
in that they describe the presentation of the content. Examples include commands to set text position, output text, set 
font type, and set current screen coordinates. Resource Identifiers 156 simply reference the desired resources. In the 
digital document realm, resources could vary from font typeface to background color. Finally, Data 1 60, 1 64 represent 
the core information communicated by the digital work. This could be the drawing coordinates used in a multimedia 
clip or the character codes for rendering as a digital document. 

[0170] An example of a digital work (in this case a simple digital document) and one of its polarized forms are shown 
in Figures 1 9 and 20, an HTML document in clear and polarized form. The tags <html> and <bocly> are Page Descrip- 
tors. The <font>...<\font> tag is an example of a Control Code for setting font resource characteristics, while "Arial M 
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[0171] Raul, 20 shoi «hal tte <kgM 3,* and <tan> w are inching*. »«« 

RealNetworks RAM, Apple QuickTime, etc. infofma tiom can be thought of as the collection of 

[0172] The system context (or system resources or resource ^ e 7^ lnctot h9FoiitTi*le. 
system resources available to a replay application on a application, the replay 

Color Palette, System Coordinates and Vo lume Se "Z^^Z^SS* trLU the digital content 
application uses the particular resource mformat.cn c ° n *' n ^J^ 

into presentation data. Each system context or resource Ij^^j^^^ element for the use of the 
,o be unique to a system for which it can be "^^^^^^^ application for replay. The 
digital work, tying use of the digital work to a specie * td.rectly reference etements contained 

Resource Identifiers and Data within the d.g.tal work m ^ e '^2^ 
within the system context, ^n^ing the digKa, wo^ 

iResiD), element identifier (EiemlD). and = ^ 

r u 7,o"= 

characteristics for T instead of 'a'. tvn« nf diaital works In addition to docu- 

10176] Relaxation and blind rendering may be used ^^^^^.SSd* datafe genera.^ 
Lts^olanzationandblindrendenng^ 

^ioS^^^^ 

Ly sTmp.ing the audio/video *P«* rea ^^^^ Yearns and then 

video device. It uses the audio/video system resources Jj^Jv^J^,,, in a {ormat expe cted by the target 

m£ rl quaiity. type and format expected by a ^^^^ some hardware system , ha t is abie to 
t017 8, The target audio* i^o de.ce ^ " and format 

convert the audio/v.deo stream (presentation date 1 ny ai » ji ^ ^ _ _^ speaker£ 
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document. The replay application (audio/video player) mixes together the unpolarized stream and the polarized stream, 
and using the polarized r source information, produces a polarized final data stream for the target audio/vid o device 
with a correct set of resource information. The target device (1 720) uses the polarized r source information to play the 
polarized data stream generating clear sound/visual effects (1724). 

5 [0181] While certain exemplary embodiments of the invention have been described in d tail above, it should b 
recognized that other forms, alternatives, modifications, versions and variations of the invention are equally operatjv 
and would be apparent to those skilled in the art. The disclosure is not intended to limit the invention to any particular 
embodiment, and is intended to embrace all such forms, alternatives, modifications, versions and variations. For ex- 
ample, the portions of the invention described above that are described as software components could be implemented 

10 as hardware. Moreover, while certain functional blocks are described herein as separate and independent from each 
other, these functional blocks can be consolidated and performed on a single general-purpose computer, or further 
broken down into sub-functions as recognized in the art. Accordingly, the true scope of the invention is intended to 
cover all alternatives, modifications, and equivalents and should be determined with reference to the claims set forth 
below. 

15 

Claims 

1. A method of protecting a digital work, z, during transformation by a transformation function, F, into presentation 
20 data F(z), comprising: 

encrypting the digital work, z, in accordance with an encryption scheme, E; 

using a blind transformation function P to transform the encrypted digital work E(z) into encrypted presentation 
data, P(E(z)), wherein P is a function of F; and 
25 decrypting the encrypted presentation data, F(E(z)), in accordance with a decryption function, D, to obtain 

the presentation data, F(z), wherein D(F'(E(z)) = F(z). 

2. The method of claim 1 , wherein the encryption scheme E is a format preserving encryption scheme. 

30 3. The method of claim 1 , wherein the encryption function E is an additive encryption scheme and wherein P = F. 

4. The method of claim 3, wherein the additive encryption scheme is selected from the group consisting of Mult, Exp, 
EG, OU, RSA and compositions thereof. 

35 5. The method of claim 1 , wherein P is a polynomial of F. 

6. A system of protecting a digital work, z, during transformation by a transformation function, F, into presentation 
data F(z), comprising: 

40 an encryption engine for encrypting the digital work z in accordance with an encryption scheme, E; 

a blind transformation function P for transforming the encrypted digital work E(z) into encrypted presentation 
data, P(E(z)), wherein F is a function of F; and 

a decryption engine for decrypting the encrypted presentation data, P(E(z)) t in accordance with a decryption 
function, D, to obtain the presentation data, F(z), wherein D(F(E(z)) = F(z). 

45 

7. The system of claim 6. wherein the encryption scheme E is a format preserving encryption scheme. 

l 1 ne system of claim ^ . vitrei:", the encryption lunc-noi. . a tii scriitiv^ encivptior. scheme and wherein F = F. 

so 9. The system of claim 8, wherein the additive encryption scheme is selected from the group consisting of Mult, Exp, 
EG, OU, RSA and compositions thereof. 

10. The system of claim 6, wherein P is a polynomial of R 

55 
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